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In this article a new programming paradigm is discussed: naturalistic programming. 
Naturalistic Programming means writing computer programs with the help of natural 
language. The authors are convinced that contemporary programming techniques have ... 
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The suitability of different parsing methods for different languages is an important topic 
in syntactic parsing. Especially lesser-studied languages, typologically different from the 
languages for which methods have originally been developed, pose interesting ... 
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This article addresses the development of statistical models for phrase-based machine 
translation (MT) which extend a popular word-alignment model proposed by IBM in the 
early 90s. A novel decoding algorithm is directly derived from the optimization ... 
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The Greenstone Digital Library Software has helped spread the practical impact of digital 
library technology throughout the world, with particular emphasis on developing 
countries. As Greenstone enters its second decade, this article takes a retrospective ... 
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In this paper, we describe a system that recognises place names in natural language 
text and produces geographic maps and animations showing the geographical coverage 
of texts about a certain subject as it changes over time. As the system is built to ... 
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Search engine logs are an emerging new type of data that offers interesting 
opportunities for data mining. Existing work on mining such data has mostly attempted 
to discover knowledge at the level of queries (e.g., query clusters). In this paper, we ... 
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As language data and associated technologies proliferate and as the language resources 
community rapidly expands, it has become difficult to locate and reuse existing 
resources. Are there any lexical resources for such-and-such a language? What tool ... 
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In cross-language information retrieval (CLIR), novel or non-standard expressions, 
technical terminology, or rare proper nouns can be seen as noise when they appear in 
queries or in the target collection. This kind of vocabulary is often out-of-vocabulary ... 
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The question of whether metonymy carries across languages has always been interesting 
for language representation and processing. Until now attempts to answer this question 
have always been based on small-scale analyses. With the advent of EuroWordNet ... 
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Previous works on question classification are based on complex natural language 
processing techniques: named entity extractors, parsers, chunkers, etc. While these 
approaches have proven to be effective they have the disadvantage of being targeted 
to ... 
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We report on the Parallel Grammar (ParGram) project which uses the XLE parser and 
grammar development platform for six languages: English, French, German, Japanese, 
Norwegian, and Urdu. 
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Studies of three prototype Web search portals---in Chinese, Spanish, and Arabic---reveal 
how to best support non-English Web searching. 
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An N-gram-based language, script, and encoding scheme-detection method is introduced 
in this article. The method detects language, script, and encoding schemes using a 
target text document encoded by computer by checking how many byte sequences of 
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